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Abstract. Searches for gravitational wave bursts that are triggered by the 
observation of astronomical events require a different mode of analysis than all- 
sky, blind searches. For one, much more prior information is usually available in 
a triggered search which can and should be used in the analysis. Second, since 
the data volume is usually small in a triggered search, it is also possible to use 
computationally more expensive algorithms for tasks such as data pre-processing 
that can consume significant computing resources in a high data-volume un- 
triggered search. From the statistical point of view, the reduction in the parameter 
space search volume leads to higher sensitivity than an un-triggered search. We 
describe here a data analysis pipeline for triggered searches, called RIDGE, and 
present preliminary results for simulated noise and signals. 



1. Introduction 

Gravitational wave (GW) searches that are triggered by the observation of an 
astronomical event have some aspects that are fundamentally different from all-sky 
or un-triggered searches. First of all, the non-GW observations furnish a wealth of 
prior information that can guide the GW data analysis. For instance, in the case of a 
Gamma-ray Burst (GRB) trigger, we usually know the sky location and, sometimes, 
the distance to the source. Based on the type of GRB, long-soft or short-hard, one can 
look for qualitatively different signals: short duration bursts for the former and binary 
inspiral signals for the latter. Second, there are many data pre-processing algorithms 
available in the literature that have been demonstrated to be useful but have not been 
used in untriggered searches because of a natural selection effect; finite computational 
resources pick the computationally least expensive algorithm. Triggered searches, on 
the other hand, usually involve much smaller data volumes and, consequently, allow 
us the freedom to experiment with algorithms. 

In this paper, we describe results from one such ongoing experiment of using new 
algorithms for analyzing data from the LIGO, GEO and Virgo detectors for triggered 
searches. We use new algorithms for both the data pre-processing (conditioning) and 
the search part of the analysis pipeline. The code for the pipeline, called RIDGE, is 
written mostly in matlab. We will not describe the code in any detail in this paper 
but focus on the algorithms and results. 
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2. Data analysis components 

A typical triggered search involves several components [1]. First, a mechanism is 
required to collect trigger information. This is already well developed and nearly 
automated within the LIGO Scientific Collaboration (LSC) , at least as far as GCN [2] 
triggers are concerned. Second, given the wealth of astrophysical prior information 
associated with most triggers, we need proper mathematical tools to incorporate this 
information into our analysis. So far, this aspect of a triggered search has seen only 
limited advances. The most detailed use of prior information has been in the search [3] 
for GWs from the SGR- 1806-20 event [4] where X-ray intensity oscillations associated 
with the flare were used to motivate a search for quasi-monochromatic signals at 
specific frequencies. However, more work is needed in this area, especially on the use 
of Baycsian approaches modeled after the ones used for a different type of "triggered" 
search [5], namely GWs from known pulsars in the Galaxy. The other components 
required in a typical triggered search, as in any other search, are data conditioning 
and the generation of a detection statistic. 

2.1. Data conditioning in RIDGE 

The purpose of data conditioning is to mitigate the effect of known instrumental 
artifacts in the data. In RIDGE this is done in a two step process. First, the noise floor 
of the power spectral density (PSD) of noise is estimated using a running median [6]. 
Fig. 1 shows the PSD of 2 sec of simulated detector noise with a power spectral 
density approximating the LIGO SRD curve for the 4km detectors. Superimposed 
on the estimated PSD is the nmning median estimate of the noise floor. The main 
advantage of the running median is its ability to reject outliers. In this particular case, 
the outliers are the narrowband noise artifacts arising from sources such as power line 
interference (modeled as sinusoids in this particular example). 

If S{f) denotes the running median estimate of the PSD, a digital filter with a 
transfer function T(/) such that |T(/)| = \/^JS{f) will whiten the data. Fig. 1 also 
shows the PSD of the output of such a digital filter. In RIDGE, the whitening filters 
are Finite Impulse Response (FIR) filters and the filter coefficients are obtained using 
the PSD obtained from a user-specified training data segment. The digital filter is 
then applied to a longer stretch of data. It is clear that any non-stationarity in the 
noise floor on timescales shorter than the gap between the training and the processed 
data can create a discrepancy in the trained filter and the filter that is required for 
whitening the longer stretch. To the knowledge of the authors, not much concrete work 
exists in the GW literature on handling this issue robustly in any analysis pipeline. 
However, tools already exist that can monitor the behaviour of the noise floor in real 
time [7], and characterize burst-like non-stationary components [8]. We intend to 
integrate these information into our pipeline in the future. 

The whitening step above is followed by a line estimation and removal step which 
reduces or, in some cases, eliminates the effect of strong narrowband noise features 
{line noise) arising from power line interference or violin modes. The method used 
here, called MBLT, is described in [6]. Essentially, the method consists of estimating 
the amplitude and phase modulation of a line feature at a given carrier frequency 
(which have to be found manually). Fig. 2 shows spectrograms of the whitened data 
before and after line removal. In this example, signals were injected into the data 
prior to line removal. As can be seen, the line removal does not affect the signals in 
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Figure 1. A sample power spectral density and the corresponding running 
median estimate of tiic noise (top plot). Note that the noise floor estimation 
was not affected by the large outlier due to a strong sinusoidal signal. The plot 
at the bottom shows the psd of raw data (top) and the psd (bottom) of the data 
after passing it through the whitening filter. The whitened data has variance 1, 
which explains the change in level to dB in the bottom plot. 



any significant way. This is an inbuilt feature of MBLT which uses the running median 

for estimation of the hne amplitude and phase functions. Transients signals appear as 
outliers in the amplitude/phase time series and are rejected by the running median 
estimate. The whitened and line removed data is then passed on to the next step of 
RIDGE. 

2.2. Detection statistic 

The basic algorithm implemented for coherent network analysis in RIDGE is regularized 
maximum likelihood, described in several recent papers [9, 10, 11]. The GW response 
of any one detector is a linear combination of the two unknown polarization waveforms 
h+{t) and hx (t) arriving at the detector from a certain direction 9o, (po on the sky. We 
assume the Earth-centered, ecliptic reference frame here for defining the polar angle, 
9, and azimuthal angle, (j). It was shown in [9] that the problem of inverting a set 
of detector responses to obtain h+{t) and hx{t) is an ill-posed one and needs to be 
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Figure 2. Spectrograms of simulated data with (top) line present in the data 
along with two transient signals and (bottom) same data with the line removed 
using MBLT. This simulation actually consisted of more lines but due to the 
limitations of the graphics used, we show only a zoomed in view of a particular 
region of the time-frequency plane. The simulated line feature appears as the 
straight horizontal line in the upper plot near 350 Hz; the feature actually consists 
of two closely spaced sinusoids at 344 and 349 Hz. 



solved by using some kind of regularization. Alternatives to the regularization used 
in [9] were proposed in [10, 11]. The theoretical aspects of regularization related to 
the direction dependent antenna patterns of the detectors in a network is fairly well 
understood now. 

The particular regularization scheme used to obtain the results reported here was 
developed in [10]. It is quite easily shown that the hard constraint algorithm presented 
in [9] is actually obtained in the limit of an infinitely large gain factor for the regulator 
of [10]. The input to the algorithm is a set of equal length, conditioned data segments 
from the detectors in a given network. The output, for a given sky location 9 and 
<j), is the value of the likelihood of the data maximized over all possible and /ix 
waveforms with durations less than or equal to the data segments. The maximum 
likelihood values are obtained as a function of 6 and (p - this two dimensional output, 
S(6', (/)), is called a sky-map. 

Fig. 3 shows examples of sky-maps obtained for the network consisting of the two 
4km LIGO interferometers and the 2km LIGO interferometer. As can be seen from 
the figure, the presence of a signal can "ring off' the entire sky. As the direction to a 
trigger will usually be well known in advance from electromagnetic observations, say, 
there is no necessity in a triggered search to restrict ourselves only to a part of the 
sky-map for the purpose of detection. We can construct a detection statistic using the 
entire sky map. In the present paper, we consider two different choices. The obvious 
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Figure 3. Plots of the sky-map for noise only case (top) and signal plus noise 
case (bottom). The detector network used was HI, H2, LI and VI. The top plot 
was made by averaging the sky-maps for 100 noise realizations. The bottom plot 
is an average over sky-maps made with 4 independent realizations of noise plus 
a signal with /irss = 5 x lO^^z Hz-^/^ ^c.f., Eq. 4). The magnitude scale is in 
arbitrary units. The test statistics are constructed by normalizing the maps. 
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So{0, ^i") = E [S{e, (l))\no signal in data] 

where E[S(0, 4>)\no signal in data] denotes an ensemble average taken over realizations 
of sky-maps that do not have any GW signals. The second statistic, i?radi is the radial 
distance of the observed values, in an appropriately scaled {Rmm, max^^^ S(^, ^)) 
plane, from the mean location of the same quantities in the absence of a signal. 

3. Results 



We carried out Monte Carlo simulations to characterize the performance of the 
detection statistics Rmm and Rrad- The network consisted of the 4km LIGO detectors 
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Figure 4. Power Spectral Densities of simulated detector noise for HI (LI), H2 
and VI. The power spectral densities were computed with a frequency resolution 
of 2.0 Hz. 



(HI and LI), the 2km LIGO detector (H2) and the Virgo detector (VI). For the 
detector noise PSD, we used the design sensitivity curves for the LIGO and Virgo 
detectors as given in [12, 13] and kept the locations and orientations the same 
as the real detectors. Gaussian, stationary noise was generated 3000 sec) by 
first generating 4 independent realizations of white noise and then passing them 
through FIR filters having transfer functions that approximately match the design 
curves. To simulate instrumental artifacts, we added sinusoids with large amplitudes 
at (54,60, 120, 180,344,349,407) Hz. The resulting PSDs of the simulated data are 
shown in Fig. 4. 

Signals of constant amplitude were added to the simulated noise at regular 
intervals. The injected signals corresponded to a single source located ai = —30° 
and (t> — 167.0°, which was the location of a real trigger, GRB060813. We assumed 
that the h^(t) and hxit) waveforms were both sine-Gaussian signals having center 
frequency 235 Hz and a Q = 9. The h^{t) waveform has an added phase of 7r/2 
relative to the phase of h^{t). Fig. 5 shows the signals, i.e., the detector responses, 
that were added to the noise. The signal strength is specified in terms of the /irss 
value, defined as 



In our results we use h^ss values of 3 x 10^^^, 5 x 10^^^ and 7 x 10~^^ Hz~ ' . 

The simulated data were generated at a sampling frequency of 16384 Hz and 
then passed through the data conditioning pipeline. Besides downsampling the data 
by applying the same anti-aliasing filter to all data streams, the data conditioning 
pipeline applies time domain whitening filters that were trained, as described earlier, 
on the first two seconds of data for each detector (without any injected signals). The 
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Figure 5. The detector responses to the injection signal used in the Monte Carlo 
simulations. The strain response for HI and H2 arc identical because of their 
co-location and co-alignment. 



conditioned data is then passed to the sky-map generation code in segments of length 
0.5 sec. This integration time is much larger than the ^ 40 msec dm^ation of the 
injected signals and, hence, the performance estimates we obtain here should be taken 
as lower bounds. The reason behind the large choice of 0.5 sec was to accomodate the 
filter delays introduced into the data. In a future version of the pipeline, these delays, 
which are known since the filters are all FIR filters, will be removed carefully. This 
will allow us to shrink the integration time for short duration signals. 

Fig. 6 shows the Receiver Operating Characteristic (ROC) curve for the 
simulation described above. ROC curves for both Rmm and i?rad are shown. Clearly, 
the i?rad statistics performs better than the more naive choice Rmm, demonstrating 
that a judicious choice of a statistic that incorporates information from the full sky- 
map is definitely required for triggered searches. We believe that more effective 
full sky-map statistics can be derived that can push performance to even better 
levels. For the current simulation and choice of parameters, we find that an /ij-ss of 
5 X 10^^^ Hz^^^^ can be detected with ~ 62% probability at a false alarm probability 
of lO^'^. The false alarm probability corresponds to a rate of 1 false event in 50 sec. 

4. Conclusion 

We describe here an analysis pipeline, RIDGE, that implements regularized maximum 
likelihood based network analysis for triggered searches. Monte Carlo simulations with 
simulated detector noise show that the pipeline is quite sensitive: For a LIGO- Virgo 
network, a ~ 62% detection probability is obtained for a signal /irss = 5 x 10~^^ Hz~^^^ 
at a false alarm rate of 2 x 10^^ Hz. All the algorithms used in this pipeline are 
significantly different from other network analysis pipelines [14, 15] that are also 
currently being applied to triggered searches. (The basic statistic used in [15] is 
the hard constraint statistic introduced in [9].) Thus, at both the level of algorithms 
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Figure 6. Receiver Operating Characteristics for Rmm (dashed) and /Jjad (solid) 
for hrss = 3 X 10^22 (bottom), 5 X 10^22 (middle) and 7 X 10"^^ Hz^^/^ (^^p)^ 
The enhancement when going from Rmm to -Rrad is shown by the arrows. 



and the associated software, RIDGE should provide an independent check for results 
obtained with other pipelines. RIDGE is also being designed to serve as a platform for 
trying out new regulators. In future work, since RIDGE pipeline does not only extract 
candidates of gravitational wave signals, but also reconstruct h+ and hx waveform at 
arbitrary sky location, we integrate a waveform estimation method [16] and investigate 
how accurately astrophysical information can be obtained. 
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